3 research outputs found

    EXPLAINABLE MODELS FOR MULTIVARIATE TIME-SERIES DEFECT CLASSIFICATION OF ARC STUD WELDING

    Get PDF
    Arc Stud Welding (ASW) is widely used in many industries such as automotive and shipbuilding and is employed in building and jointing large-scale structures. While defective or imperfect welds rarely occur in production, even a single low-quality stud weld is the reason for scrapping the entire structure, financial loss and wasting time. Preventive machine learning-based solutions can be leveraged to minimize the loss. However, these approaches only provide predictions rather than demonstrating insights for characterizing defects and root cause analysis. In this work, an investigation on defect detection and classification to diagnose the possible leading causes of low-quality defects is proposed. Moreover, an explainable model to describe network predictions is explored. Initially, a dataset of multi-variate time-series of ASW utilizing measurement sensors in an experimental environment is generated. Next, a set of pre-possessing techniques are assessed. Finally, classification models are optimized by Bayesian black-box optimization methods to maximize their performance. Our best approach reaches an F1-score of 0.84 on the test set. Furthermore, an explainable model is employed to provide interpretations on per class feature attention of the model to extract sensor measurement contribution in detecting defects as well as its time attention

    Truveta Mapper: A Zero-shot Ontology Alignment Framework

    Full text link
    In this paper, a new perspective is suggested for unsupervised Ontology Matching (OM) or Ontology Alignment (OA) by treating it as a translation task. Ontologies are represented as graphs, and the translation is performed from a node in the source ontology graph to a path in the target ontology graph. The proposed framework, Truveta Mapper (TM), leverages a multi-task sequence-to-sequence transformer model to perform alignment across multiple ontologies in a zero-shot, unified and end-to-end manner. Multi-tasking enables the model to implicitly learn the relationship between different ontologies via transfer-learning without requiring any explicit cross-ontology manually labeled data. This also enables the formulated framework to outperform existing solutions for both runtime latency and alignment quality. The model is pre-trained and fine-tuned only on publicly available text corpus and inner-ontologies data. The proposed solution outperforms state-of-the-art approaches, Edit-Similarity, LogMap, AML, BERTMap, and the recently presented new OM frameworks in Ontology Alignment Evaluation Initiative (OAEI22), offers log-linear complexity in contrast to quadratic in the existing end-to-end methods, and overall makes the OM task efficient and more straightforward without much post-processing involving mapping extension or mapping repair

    Gene selection for optimal prediction of cell position in tissues from single-cell transcriptomics data

    No full text
    Single-cell RNA-sequencing (scRNAseq) technologies are rapidly evolving. Although very informative, in standard scRNAseq experiments, the spatial organization of the cells in the tissue of origin is lost. Conversely, spatial RNA-seq technologies designed to maintain cell localization have limited throughput and gene coverage. Mapping scRNAseq to genes with spatial information increases coverage while providing spatial location. However, methods to perform such mapping have not yet been benchmarked. To fill this gap, we organized the DREAM Single-Cell Transcriptomics challenge focused on the spatial reconstruction of cells from the Drosophila embryo from scRNAseq data, leveraging as silver standard, genes with in situ hybridization data from the Berkeley Drosophila Transcription Network Project reference atlas. The 34 participating teams used diverse algorithms for gene selection and location prediction, while being able to correctly localize clusters of cells. Selection of predictor genes was essential for this task. Predictor genes showed a relatively high expression entropy, high spatial clustering and included prominent developmental genes such as gap and pairrule genes and tissue markers. Application of the top 10 methods to a zebra fish embryo dataset yielded similar performance and statistical properties of the selected genes than in the Drosophila data. This suggests that methods developed in this challenge are able to extract generalizable properties of genes that are useful to accurately reconstruct the spatial arrangement of cells in tissues
    corecore